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37. A medium according to claim 36, wherein the profile data 
includes parameters separately characterizing at least one item selected from 
the group including lexical competence, grammatical competence, semantic 
competence, and phonological competence. 
5 38. A medium according to claim 36, wherein the profile data 

includes parameters separately characterizing at least two items selected from 
the group including lexical competence, grammatical competence, semantic 
competence, and phonological competence. 

39. A medium according to claim 36, wherein the profile data 
10 includes parameters separately characterizing at least three items selected from 
the group including lexical competence, grammatical competence, semantic 
competence, and phonological competence. 
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the group including lexical competence, grammatical competence, semantic 
competence, and phonological competence, 

32. A method according to claim 30, wherein the profile data 
includes parameters separately characterizing at least two items selected from 
the group including lexical competence, grammatical competence, semantic 
competence, and phonological competence. 

33. A method according to claim 30, wherein the profile data 
includes parameters separately characterizing at least three items selected from 
the group including lexical competence, grammatical competence, semantic 
competence, and phonological competence. 

34. A digital storage medium digitally encoded with instructions, 
which when loaded into a computer establish a system for personalized 
language exercise generation, the system comprising: 

a. a language exercise database; 

b. an exercise generator coupled to the language exercise 
database, the exercise generator for selecting a language exercise from the 
language exercise database; and 

c. a leaner's profile database, coupled to the exercise generator, in 
such a marmer that the language exercise reflects parameters of a user with 
respect to whom there exists an entry in the learner's profile database. 

35. A medium according to claim 34, wherein the system further 
comprises: 

d. a scoring arrangement for scoring the results of use. of the 
exercise and for updating the learner's profile database based on the 
performance of a user in doing the exercise. 

36. A medium according to claim 35, wherein the learner's profile 
database includes profile data for a non-null set of learners, such profile data 
including parameters characterizing, for each learner in the set, such learner's 
linguistic competence. 
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parameters separately characterizing at least one item selected from the group 
including lexical competence, grammatical competence, semantic competence, 
and phonological competence. 

26. A system according to claim 24, wherein the profile data includes 
5 parameters separately characterizing at least two items selected from the 

group including lexical competence, grammatical competence, semantic 
competence, and phonological competence. 

27. A system according to claim 24, wherein the profile data includes 
parameters separately characterizing at least three items selected from the 

10 group including lexical competence, grammatical competence, semantic 
competence, and phonological competence. 

28- A method for generating a personalized language exercise, the 
method comprising: 

a. retrieving exercises from a language exercise database, stored 
15 in a digital storage medium; 

b. retrieving user data from a learner's profile database, stored in 
a digital storage medium, pertinent to a user; and 

c. selecting a language exercise from the language exercise 
database, based on the user data. 

20 29. A method according to claim 28, further comprising 

d. scoring the results of use of the exercise and updating the 
learner's competence profile database based on the performance of the user in 
doing the exercise. 

30. A method according to claim 29, wherein the learner's profile 
25 database includes profile data for a non-null set of learners, such profile data 

including parameters characterizing, for each learner in the set, such learner's 
linguistic competence. 

31. A method according to claim 30, wherein the profile data 
includes parameters separately characterizing at least one item selected from 
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includes parameters separately characterizing at least one item selected from 
the group including lexical competence, grammatical competence, semantic 
competence, and phonological competence. 

20. A medium according to claim 18, wherein the profile data 
includes parameters separately characterizing at least two items selected from 
the group including lexical competence, grammatical competence, semantic 
competence, and phonological competence. 

21. A medium according to claim 18, wherein the profile data 
includes parameters separately characterizing at least three items selected from 
the group including lexical competence, grammatical competence, semantic 
competence, and phonological competence. 

22. A system for personalized language exercise generation, the 
system comprising: 

a. a language exercise database 

b. an exercise generator coupled to the language exercise 
database, the exercise generator for selecting a language exercise from the 
language exercise database ; and 

c. a learner's profile database, coupled to the exercise generator, 
in such a rnanner that the language exercise reflects parameters of a user with 
respect to whom there exists an entry in the learner's profile database. 

23. A system according to claim 22, further comprising: 

e. a scoring arrangement for scoring the results of use of the 
exercise and for updating the learner's profile database based on the 
performance of the user in doing the exercise. 

24. A system according to claim 23, wherein the learner's profile 
database includes profile data for a non-null set of learners, such profile data 
including parameters characterizing, for each learner in the set, such learner's 
linguistic competence. 

25. A system according to claim 24, wherein the profile data includes 
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the group including lexical competence, grammatical competence, semantic 
competence, and phonological competence. 

14. A method according to claim 12, wherein the profile data 
includes parameters separately characterizing at least three items selected from 
the group including lexical competence, grammatical competence, semantic 
competence, and phonological competence. 

15. A digital storage medium digitally encoded with instructions^ 
which when loaded into a computer establish a system for personalized 
language exercise generation, the system comprising: 

a. a text input for receiving a selected text; 

b. a linguistic analyzer coupled to the text input for analyzing the 

selected text; 

c. an exercise generator coupled to the linguistic analyzer for 
generating a language exercise from the selected text; and 

d. a learner's profile database, coupled to the exercise generator, 
in such a manner that the language exercise reflects parameters of a user with 
respect to whom there exists an entry in the learner's profile database. 

16. A medium according to claim 15, wherein the linguistic analyzer 
includes a morphology analyzer and a syntax analyzer. 

17. A medium according to claim 15, wherein the system further 
comprises: 

e. a scoring arrangement for scoring the results of use of the 
exercise and for updating the learner's profile database based on the 
performance of the user in doing the exercise. 

18. A medium according to claim 15, wherein the learner's profile 
database includes profile data for a non-null set of learners, such profile data 
including parameters characterizing> for each learner in the set, such learner's 
linguistic competence. 

19. A medium according to claim 18, wherein the profile data 


25 


wo 00/42589 


PCT/IB99/02126 


7. A system according to claim 5, wherein the profile data includes 
parameters separately characterizing at least three items selected from the 
group including lexical competence, grammatical competence, semantic 
competence, and phonological competence. 

8. A method for generating a personalized language exercise, the 
method comprising: 

a. receiving a selected text at a text input; 

b. linguistically analyzing the selected text to produce analytical 

data; 

c. retrieving user data from a learner's profile database, stored in 
• a digital storage medium, pertinent to a user; and 

d. generating an exercise based on the selected text, the 
analytical data, and the user data. 

9. A method according to claim 8, wherein linguistically analyzing 
the selected text includes analyzing both morphology and syntax of the text. 

10. A method according to claim 8, further comprising: 

e. scoring the results of use of the exercise and updating the 
learner's profile database based on the performance of the user in doing the 
exercise. 

11. A method according to claim 8, wherein the learner's profile 
database includes profile data for a non-null set of learners, such profile data 
including parameters characterizing, for each learner in the set, such learner's 
linguistic competence. 

12. A method according to claim 11, wherein the profile data 
includes parameters separately characterizing at least one item selected from 
the group including lexical competence, grammatical competence, semantic 
competence, and phonological competence. 

13. A method according to claim 12, wherein the profile data 
includes parameters separately characterizing at least two items selected from 
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What is claimed is: 

1. A system for personalized language exercise generation, the 

system comprising: 

a. a text input for receiving a selected text; 
5 b. a linguistic analyzer coupled to the text input for analyzing 

the selected text; 

c, an exercise generator coupled to the linguistic analyzer for 
generating a language exercise from the selected text; and 

d. a learner's profile database, coupled to the exercise generator, 
10 in such a maimer that the language exercise reflects parameters of a user with 

respect to whom there exists an entry in the learner's profile database. 

2. A system according to claim 1, wherein the linguistic analyzer 
includes a morphology analyzer and a syntax analyzer. 

3. A system according to claim 1, further comprising: 

J 5 e. a scoring arrangement for scoring the results of use of the 

exercise and for updating the learner's profile database based on the 
performance of the user in doing the exercise. 

4. A system according to claim 1, wherein the learner's profile 
database includes profile data for a non-null set of learners, such profile data 

20 including parameters characterizing, for each learner in the set, such learner's 
linguistic competence. 

5. A system according to claim 4, wherein the profile data includes 
parameters separately characterizing at least one item selected from the group 
including lexical competence, grammatical competence, semantic competence, 

25 and phonological competence. 

6. A system according to claim 5, wherein the profile data includes 
parameters separately characterizing at least two items selected from the 
group including lexical competence, grammatical competence, semantic 
competence, and phonological competence. 
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minimum competence value, the exercise is selected. 

b) For a "item-specific rule" competence item, if more than a 
minimum percentage of the selected items in the exercise are unknown, the 
exercise will be selected. 

Once the exercises are selected, the student 124 will complete the 
exercises and produce a set of results 126. The results 126 will then be used to 
update the learners competence profile 128. 

The described embodiments of the invention are intended to be merely 
exemplary and numerous variations and modifications will be apparent to 
those skilled in the art. All such variations and modifications are intended to 
be within the scope of the present invention as defined in the appended claims. 
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word groups and general exercise parameters. The exercises in the language 
exercise database 120 may have an associated competence level. The 
associated competence levels may be used to select exercises from the 
language exercise database 120 that match the learner's competence level. 
5 Accordingly, the other main component of the system is the learners' 

competence profile 128 that is a model for the learner's language proficiency, 
i.e. lexical, grammatical, semantic and phonological competencies. A 
competence agent maps and updates the learners' competence profile 128. 

Based on these two components, a teacher can generate, with the help of 
10 the system, a series of language exercises taking into account the learners' 
competence level in the target language. In a preferred embodiment, the 
language exercises are directed towards correct pronunciation, text 
comprehension, vocabulary and grammar. 

The results of each exercise completed by the learner are fed back into 
15 the learner's database for the learner in the learners' competence profile 128. 

As discussed above with respect to Figure 1, the learner's competence profile 
128 may be continuously and automatically updated with the newly obtained 
information about the learners' language knowledge. The learners 
competence profile 128 is updated in a similar marmer to that described with 
20 respect to Figures 10 and 11 above. 

An exercise generator 122 is coupled to the language exercises database 
120 and the learners' competence profile 128. The exercise generator 122 may 
select exercises from the language exercise database 120 that match the 
learners' competence level. First, the teacher selects a theme or exercise type. 
25 Next, based on the learner's profile from the learner's competence profile 128, 
a set of exercises is selected from the language exercise database 120. For 
example, exercises may be selected as follows: 

a) For an exercise on a "general rule" competence item, if the 
competence level of the learner for the competence item (i) is less than a 
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database for the selected text items. 

The results of the exercises completed by the student, as shown in Table 
110, may be used to update the learners' database for the learner in the 
learners' competence profile. The following parameters are updated for each 
entry in the learners database corresponding to the words or rules used in the 
exercises: 

a) Right or wrong answers 

b) Number of times the item has been treated 

c) Context in which the item has been treated 

d) Last time the item has been treated 

e) Lapse of time 

Table 114 in Figure 11 shows the learners database for a learner N and the 
fields updated with the results of the exercises as shown in Table 110. The 
score for an existing entry will be recalculated using the exercise results. In 
addition, new entries will be created for items that have been addressed for the 
first time by learner N. In a preferred embodiment, the new score for an item 
is a function of the previous score, whether the current answer is correct or 
incorrect, the number of correct answers for the item, the number of incorrect 
answers for the item, the context in which the item is solved and the amount of 
time taken to solve the exercise. 

Figure 12 is a block diagram of a system for generating personalized 
language exercises in accordance with an alternative embodiment of the 
invention. In accordance with the alternative embodiment, various language 
exercises are selected from a central language exercise database such that the 
selected exercises match the competence level of a learner. 

A basic component of the system of Figure 12 is a language exercise 
database 120 that contains all the data necessary to generate an exercise. In a 
preferred embodiment, this data may include a particular text, a list of selected 
words or word-groups, additional information for each of these words or 
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for exercise ger\eration. If too few or too mar\y words are selected, the 
selected text may be defined as not suitable for the competence level of the 
learner. In this case, the teacher may then select another competence item or 
another text. 

5 Once the competence items and corresponding words and/ or rules are 

selected, the teacher may set his/her preferences and create an exercise(s) with 
the selected text items. As discussed above with respect to Figure 4, there are 
numerous preferences which may be selected by the teacher. Preferably, an 
exercise includes: 
10 a) the title and description of the exercise 

b) the number of times a particular field may be filled in 

c) the time allowed to do the exercise 

d) how the fields may be replaced 

e) the evaluation per field or when the exercise is 

15 completed 

f) the type of hint and /or activation of hint 

g) error analysis yes /no 

h) drag and drop yes/ no 

i) the identification of teacher 

20 Once the exercise or exercises are generated, the exercise(s) may be 

completed by the learner or student in item 108. The results of the exercises 
completed by the student are then used to update the learners' competence 
profile in item 109. Figure 11 is a table illustrating how the learners' 
competence profile is updated. Figure 11 illustrates text items selected for 

25 exercises relating to a grammatical rule on the formation of irregular simple 
past word forms and the formation of regular simple past word forms. Table 
110 in Figure 11 shows the text items selected, the answers given by the 
student and whether the answer given was correct (1) or incorrect (0). Table 
112 in Figure 11 shows the corresponding entries in the general language 
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exercises completed by the student will be discussed with respect to Figures 10 
and 11. 

Referring to Figure 10, the teacher or the system selects a text in item 
100. As discussed above with respect to Figure 1, the text may be selected 
using a learners' interest profile such that the text selected matches the 
interests of the learner. The selected text will then be received at the text input 
and imported into the system in item 102. Once the text is imported into the 
system, it is subjected to both morphological and syntactic analysis to 
determine the linguistic information associated with the text in item 104. As 
discussed above with respect to Figure 1, the morphological and syntactic 
analysis may be performed using techniques of general use in the art. 

The linguistic data and the information in the learners competence 
profile are used to generate language exercises for the student in item 106. 
Based on the information in the learners' competence profile for a particular 
learner or group of learners, the system may select competence items from the 
competence database which have not yet been mastered by the student. In a 
preferred embodiment, if the competence level for a competence item (i) is less 
than a minimum competence value, the competence item (i) is selected as a 
candidate for exercise generation. The minimum competence value is a 
constant predetermined parameter. If the competence level for a competence 
item (i) is greater than the minimum competence value, the competence item 
(i) is defined as "known" and therefore not selected as a candidate for exercise 
generation. 

Based on the selected competence items from the competence database, 
words and phrases that are related to the selected competence items are 
selected from the text. For a general rule competence item, all words with the 
corresponding word classe(s) and extended word information are selected for 
exercise generation. For an item-specific rule competence item, all the 
unknown words or rules corresponding to the competence item are selected 
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an exercise and is known by the student. If the score is equal to zero, 
the corresponding item in the general database has not been the subject 
of an exercise completed by the student. The absolute value of the score 
is an indication of the degree of knowledge of the student with regard 
5 to the corresponding item in the general language database, (see column 

92 in Figure 9) 

b. -hanswers: the number of correct answers for the 
corresponding item in the general language database, (see column 93 in 
Figure 9) 

10 c. -answers: the number of incorrect answers for the 

corresponding item in the general language database, (see column 94 in 
Figure 2) 

d. contexts: the context in which the item was solved (correctly or 
incorrectly), (see column 95 in Figure 9) 
1 5 e. last time exercise solved: the date and time of last time the item 

was solved (correctly or incorrectly), (see column 96 in Figure 2) 
The learners competence level for a particular competence item (i) in the 
competence database may be calculated using data from the learners database. 
In a preferred embodiment, the competence level for a competence item (i) is a 
20 function of the scores (i) for all entries from the learners database which relate 
to the competence item (i). 

As a student completes an exercise(s), the learners database (see Figure 
9) is updated and a new score calculated for the item involved in the 
exercise(s). Figure 10 is a block diagram illustrating the logical flow of a 
25 method of generating personalized language exercises associated with a 

preferred embodiment of the invention. Figure 11 is a table illustrating how 
the learners' competence profile (See Figure 6) is updated using the results of 
the exercise(s) completed by the student. The process of creating exercises 
and updating the learners competence profile based on the results of the 
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database preferably includes one or more of the following fields: 

a. Type: general rule or item-specific rule (see column 82 in 

Figure 8) 

b. Description of the competence item (see column 83 in Figure 8) 

c. Word class: word classes and extended word information 
corresponding to the competence item, (see column 84 in Figure 8) 

d. Competence type: lexical, grammatical, semantic or 
phonological (see colunrn 85 in Figure 8) 

For example, competence item 5 of the competence database shown in 
Figure 8, is an item-specific rule for the formation of the irregular simple past 
form of a verb. The rule is a grammatical competence type. As mentioned 
above, the general language database classifies each of its entries as relevant to 
a particular competence item in the competence database. For example, entry 
2 in the general language database (Figure 7) is relevant to the first competence 
item in the competence database as shown in Figure 8. 

A preferred structure for the learner's database is shown in Figure 9. 
Figure 9 illustrates a profile for a particular student N. As mentioned above, 
the learners database includes a separate profile for each student (1 to N) who 
uses the system. Each entry of the learners database for a student N refers to a 
corresponding entry in the general language database (Figure 7) and indicates 
the student's score for exercises pertaining to the particular word or rule. Each 
entry in the learners database preferably includes one or more of the following 
fields: 

a. score: The score is a measure of the competence level of the 
student for this word, phrase or rule. In a preferred embodiment, the 
score is a value between -1 and 1. If the score is less than zero, then the 
corresponding item in the general database has been the subject of an 
exercise but is unknown to the student. If the score is greater than zero, 
the corresponding item in the general database has been the subject of 
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will include the dictionary base forms, a word description and the attributes of 
the word. For example, the word "people" is a noun and has the attribute of 
being uncountable. 

b) Type 2: fixed expressions. The fixed expressions are a definite 
5 set of the most frequently used expressions in the given language. 

c) Type 3: idioms 

d) Type 4: terminology 

e) Type 5: synonyms /antonyms 

f) Type 6: neologisms 
10 g) Type 7: collocations 

h) Type 8: word groups, e,g., phonological or contrastiye pairs 

i) Type 9: grammatical rules and accompanying word class and 
extended word information combination. 

As shown in Figure 7, each entry in the general language database 
1 5 preferably includes one or more of following fields: 

a. Type or classification (e.g. Type 1: single word form) (see 
column 72 in Figure 7) 

b. Word class (e.g. noun) (see column 73 in Figure 7) 

c. Value (e.g."people") (see column 74 in Figure 7) 

20 d. Level (basic, threshold, proficiency, mastery) (see column 76 in 

Figure 7) 

e. Competence item (related to competence database) (see 
column 77 in Figure 7) 

The competence item 77 refers to a competence item from the 
25 competence database 64 (Figure 6). Each entry in the general language 

database is related to the competence item designated in the competence item 
field 77. A preferred structure of the competence database is shown in Figure 
8. The competence database includes numerous competence items such as, for 
example, grammatical rules and lexical items. Each entry in the competence 
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and if error analysis should be active. As shown in Figure 1, a student or 
learner 14 will complete the exercise(s) producing a set of results 14. The 
results 14 of each exercise completed by the learner are fed back into a 
database for the learner in the learners' competence profile 15. As discussed 
above, the competence agent keeps track of information concerning the actual 
level of language knowledge and mastery and updates the learners' 
competence profile 15. In this manner, the learners' competence profile 15 
may be continuously and automatically updated with the newly obtained 
information about the learner's language knowledge. 

The learners' competence profile 15 will now be discussed in more 
detail with respect to Figures 6-11. As shown in Figure 6, the learners' 
competence profile contains a general language database 60, a learners 
database 62 and a competence database 64, The general language database 60 
includes entries for all of the known words, phrases and rules for a given 
language, including, for, example single word forms, synonyms /antonyms, 
neologisms, word groups, collocations, fixed expressions, idioms and 
grammatical rules. In a preferred embodiment, a user may also add entries to 
the general language database 60. The learners database 62 includes a separate 
database of information for each learner or student using the system. The 
learners database 62 is a specific subset of the general language database 60. 
As such, each entry in the learners database 62 for a learner N has a 
corresponding entry in the general language database 60. The competence 
database 64 includes the different types of competence items which may be 
evaluated using the exercises generated by the system. 

A preferred structure of the general language database 60 is shown in 
Figure 7. Each entry 71 refers to a type(or classification) 72 of word, phrase or 
rule. The following classifications may be utilized by the general language 
database: 

a) Type 1: single word forms. An entry for a single word form 
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word in the text. When a teacher wants to selects words manually, the teacher 
directly selects the word itself, for example, using a keyboard or a mouse. 

In the second step, the teacher can add different kinds of didactic 
information to a specific word, or group of words including hints (column 46 
5 in Figure 4), alternatives and possible feedback (column 47 in Figure 4), an 

image, a video or sound, or a link to a World Wide Web address (column 48 in 
Figure 4). A "hint" is a text, sound or image which is intended to help the 
student to find the right answer when doing the exercise. For example, when 
the student must fill in the word "computer", the text "electronic device" could 

1 0 appear on screen. 

"Alternatives" are answers from which the student can choose. An 
"alternative" can be set to be a real alternative to the right answer or an 
anticipated wrong answer. For example, when the student must fill in the 
word "has", two alternatives could appear on screen from which one has to 

15 make a choice: "has" and "have". When the student chooses the right answer, 
in this example "has", supporting information may also be provided to the 
student. This supporting information is called "feedback". The teacher may 
also select various other preferences such as adding an image, a video'^or 
sound, or a link to a World Wide Web address. The teacher may also combine 

20 steps 1 and 2 by using a "favorites" designation to generate basic predefined 
exercise types. Figure 5, is a copy of a computer display showing the user 
interface in connection with the embodiment of Figure 1. Figure 5, illustrates 
the "favorite" option available to a user. 

Besides adding data to words, the teacher may also specify global 

25 parameters which will effect the entire exercise. For example, the teacher may 
specify that all selected words must appear in a list on top of the text when 
solving the exercise, or that the infirvitive of verbs must appear in brackets. 
The teacher may also specify how many times an exercise can be solved, the 
maximum time allowed to solve an exercise, how words should be substituted. 
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Returning to Figure 1, information from the learners' competence 
profile 15^ together with information obtained from the linguistic analysis of 
the text by the linguistic analyzer 11, serve as input for the exercise generator 
12. The exercises to be generated are mainly the result of a matching process 
of text content and the content of the learners' competence profile 15. When 
the exercise generator 12 generates a new series of exercises for a learner or 
group of learners, the competence agent accesses the learners' competence 
profile 15. By screening the newly imported text against the information in the 
learners' competence profile 15, the system is able to generate exercises that 
are specifically directed to a specific learner or group of learners. 

Fig. 4 is a table setting forth, with respect to the embodiment of Fig. 1, 
user preferences 40 available in generating exercises using the selected text. 
Generating an exercise is a two-step process: 1) selecting the words to generate 
an exercise on; and 2) adding data to these words and setting global 
preferences. 

In the first step, words in the text may be selected either automatically 
or manually. When a teacher wants to select words automatically, the teacher 
may define which words to select. There are several ways in which a teacher 
can define which words for the exercise generator to select. First, the teacher 
may define a "mask" {e.g., the mask *'*ing" will select all words ending in "ing"). 
Second, the teacher may specify a word class in order to select all the words 
from the text in the specified word class {e.g., select "verb" to select all possible 
verbs). Third, the teacher may select a word class and "extended word 
information" such as singular/ plural {e.g.,^ select "noun" and " plural" to select 
all plural nouns). Fourth, the teacher may combine "mask", "word class" and 
"extended word information," whereby it is possible to take into account the 
specific learner's competence profile. Fifth, the teacher may specify a word 
list. The word list may be a pre-defined word list or a word list based on the 
learners' competence profile. Finally, the teacher may select the position of a 
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in Fig. 3 aims to offer parameters and categories that are useful for the 
description of linguistic content. The system distinguishes among lexical 
competence (column 32 in Figure 3), grammatical competence (column 34 in 
Figure 3), semantic competence (column 36 in Figure 3), and phonological 
5 competence (coliunn 38 in Figure 3). 

Lexical competence is the knowledge of^ and ability to use, the 
vocabulary of a language. Lexical competence consists of both lexical elements 
and grammatical elements as shown in column 32 of Figure 3. For example, 
lexical elements may be fixed expressions or single word forms and 

10 grammatical elements may be articles, conjunctions or prepositions. 
Grammatical competence is the knowledge of, and ability to use, the 
grammatical resources of a language. The grammar of a language may be seen 
as the set of principles governing the assembly of elements into meaningful 
labeled and bracketed strings (z.e., sentences). Grammatical competence is the 

15 ability to understand and express meaning by producing and recognizing 
well-formed phrases and sentences in accordance with these principles, in 
contrast to memorizing and reproducing them as fixed formulae. Examples of 
the different grammatical resources are shown in column 34 of Figure 3, such 
as morphemes, active/passive voice and conjugations. 

20 Semantic competence deals with the learner's awareness and control of 

the organization of meaning as shown in column 36 of Figure 3. For example, 
lexical semantics involve questions of word meaning. Grammatical semantics 
involve the meaning of grammatical elements, categories, structures and 
processes. Pragmatic semantics involve logical relations such as entailment, 

25 presupposition and implicature. Phonological competence involves a 

knowledge of, and skill in the perception and production of sound units, 
including the phonetic features which distinguish phonemes, the phonetic 
composition of words, sentence phonetics, and phonetic reduction. Examples 
of phonological competence are show^n in column 38 of Figure 3. 
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phonological competencies. By creating exercises that are directed towards a 
specific learner's abilities, the learner is not discouraged by being confronted 
with exercises which are either too difficult, too easy or not relevant. 
Accordingly, the learners' competence profile database 15, which models the 
learner's extent of mastery of the target language at any given moment, is 
coupled to an exercise generator 12. The exercise generator 12 is also coupled 
to the linguistic analyzer 11 that, as discussed above, provides linguistic data 
for the selected text. By focusing on the learner's specific interest and ability 
level while generating the learning material, preferred embodiments of the 
invention provide significantly enhanced language learning effectiveness. In 
this way the system builds a bridge between the learner's field of interest, 
exemplified by the selected text, and the learner's specific ability level, which is 
represented in the learners' competence profile 15, 

The learners' competence profile 15 contains a digital representation of 
the learners' current language proficiency, i.e. the learners' lexical, 
grammatical, semantic and phonological competence. A competence agent 
maps and updates the learners' competence profile 15. The competence agent 
updates the learners' competence profile after every relevant event, such as the 
completion of an exercise. In this manner, after every relevant learning event, 
the system adapts itself to the learner's progress. Specific parameters of the 
language exercise(s) are updated in the learners' competence profile 15. 
Parameters advantageously handled by the system are: 1) right and wrong 
answers; 2) the number of times a new word or grammatical rule has been 
treated; 3) the context in which a new word or grammatical rule has been 
treated; 4) the last time an element was treated; and 5) the lapse of time for 
solving the exercise. 

Fig. 3 is a table identifying parameter characteristics associated with the 
learners' competence profile of Fig. 1. There is at present no general theory of 
linguistic competence that commands general acceptance. The scheme shown 
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the next largest data stream is identified in item 202. Next, it is determined 
whether the identified data stream is possibly a word in item 203. If the data 
stream is not a word, processing returns to item 201. If the identified data 
stream is a word, then the stream is morphologically analyzed at item 204, 
5 The morphological analysis may be performed using techniques of general use 
in the art. During this analysis, linguistic characteristics of the word are 
identified, such as, for example, the part-of -speech of the word or whether the 
word is plural or singular. If the analysis results are recognized in item 205, 
then the data stream and its associated lexical information are added to the 

10 internal representation of the text at item 207. 

If, however, the morphological analysis results are not recognized, the 
data stream may be further analyzed as shown in items 208-211. At item 208, 
the data stream is separated into its possible components. Each component 
will be morphologically analyzed at item 209 until the end of the original data 

15 stream has been reached at item 211. 

As mentioned above, when the end of the text file is reached at item 201, 
the linguistic processing continues with the syntactic analysis as show in 
Figure 2B. The syntactic analysis may be performed using techniques of 
general use in the art. Each sentence in the text file may be analyzed at item 

20 216. The results of the syntactic analysis of a sentence are then used to refine 
the lexical analysis information of each word in the sentence at item 217. The 
process of items 217-219 are repeated for each word within the sentence until 
the end of the sentence has been reached at item 218. The syntactic analysis is 
performed for each sentence in the text file until the end of the text file is 

25 reached at item 214. 

Returning to Figure 1, the other main component utilized by the system 
to generate personalized interactive language exercises is a learner's 
competence profile 15. The learners' competence profile 15 is a model for the 
students' language proficiency, i.e. lexical, grammatical, semantic and 
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for texts that match the interest of the learner as specified in the learners 
interest profile 16. 

Once the text is selected and imported into the system, the text is 
subjected to linguistic analysis by a linguistic analyzer 11. The linguistic 
analysis is advantageously executed on the phrase level as well as the word 
level. In other words, both morphological analysis and syntactic analysis is 
performed by the linguistic analyzer 11. The linguistic data resulting from 
these analyses are one of the basic components for the system to generate 
personalized, interactive language exercises. Morphological analysis is 
conducted on the individual words in the text and syntactic analysis is 
conducted on the individual words' relations in the sentence in the text. By 
analyzing the word and the context in which it is being used, the level of 
ambiguity in the labeling process will be substantially reduced. Since the 
results of the linguistic analysis serve as input for the transformation of texts 
into exercises, this in turn will substantially enhance and refine the exercise 
generation process. 

In order to process the document provided at the text input 10, the text 
of the document is first separated from the other document components, such 
as, for example, mark-up data, pictures and annotations in the case of an 
HTML-document. Figures 2A and 2B are block diagrams illustrating the 
logical flow of the morphologic and syntactic analysis performed by the 
linguistic analyzer 11 of Figure 1. As mentioned above, once a text has been 
selected and imported, it is subjected to both morphologic and syntactic 
analysis. Figure 2A shows a preferred embodiment of the logical flow of the 
morphological analysis of the text. First, it is determined whether the end of 
the text file has been reached in item 201. If the end of the text file has been 
reached, the processing continues with the syntactic analysis of the text as 
shown in Figure 2B. 

Returning to Figure 2A, if the end of the text file has not been reached. 
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system for personalized language exercise generation in accordance with a 
preferred embodiment of the invention. Embodiments of the present 
invention may be implemented as software for use on a personal computer 
running an operating system such as Windows 95 or Windows NT, version 4, 
5 Windows 2000 Professional or the Windows 2000 Server Family available from 
Microsoft Corporation, Redmond, Washington. As used in this description 
and the following claims, a "teacher" is person who creates the exercise(s), and 
a "student" or "learner" is the person who does the exercise(s). 

Returning to Figure 1, text is imported from a digital source at the text 

10 input 10. A language learner will be especially motivated to work on text 
material that is related to his or her sphere of interests and /or activities. A 
learner's interest profile 16 is coupled to the text input 10 such that the text 
selected will match the specific interests of the learner or learners. The 
learners' interest profile 16 holds and keeps track of the learners' interests. The 

15 learners interest profile 16 may contain positive (i.e., interested in) and 

negative (i.e., not interested in or forbidden texts or sites) information. A 
learners interest agent keeps track of the learners interest profile 16 and 
updates the learners' interest profile 16 every time a document or text is 
chosen. By selecting text that matches the interests of the learner, one of the 

20 main conditions for effective learning is fulfilled, namely being interested in 
the subject-matter of the text. 

In a preferred embodiment, the selected text may be imported from the 
Internet. The Internet is an immensely rich source of information from which 
to choose text that may be used to generate language exercises. One can find 

25 for instance, at any moment of the day, interesting newspaper articles on the 

Internet and use this as a basis to generate learning material. Accordingly, in a 
preferred embodiment, a Web agent 17 is coupled to the learners' interest 
profile 16 to optimize text selection from the Internet such that the text selected 
matches the interests of the learner. The Web agent 17 may search the Internet 
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preferred embodiment of the invention. 

Fig. 4 is a table setting forth, with respect to the embodiment of Figure 
1, user preferences available in transforming a selected text into exercises. 

Fig. 5 is a copy of a screen showing the user interface in cormection with 
5 the embodiment of Fig. 1. 

Fig. 6 shows the structure of the learners' competence profile in 
accordance with a preferred embodiment of the invention. 

Fig. 7 shows the structure of the general language database of the 
learners' competence profile of Figure 7 in accordance with a preferred 
10 embodiment of the invention. 

Fig. 8 shows the structure of the competence database of the learners' 
competence database of Figure 7 in accordance with a preferred embodiment 
of the invention. 

Fig. 9 shows the structure of the learner database of the learners' 
15 competence profile of Figure 7 in accordance with a preferred embodiment of 
the invention. 

Fig. 10 is a block diagram illustrating the logical flow of a method of 
generating personalized language exercises in accordance with a preferred 
embodiment of the invention. 
20 Fig. 11 is a table illustrating how the learners' competence profile is 

updated in accordance with a preferred embodiment of the invention. 

Fig. 12 is a block diagram illustrating a system for personalized 
language exercise generation in accordance with an alternative embodiment of 
the invention. 

25 Detailed Description of Specific Embodiments 

In accordance with a preferred embodiment of the invention^ it is 
possible to instantaneously generate various language exercises that are tailor- 
made for a learner utilizing any freely chosen text or a text specifically 
matched to the learners' interests. Fig. 1 is a block diagram illustrating a 
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generator, in such a manner that the language exercise reflects parameters of a 
user with respect to whom there exists an entry in the learners' profile 
database. The exercise may be selected based on the theme of the text, the type 
of exercise or another parameter contained in the exercise. 
5 In another set of embodiments, the invention provides a digital storage 

medium digitally encoded with instructions, which when loaded into a 
computer establish any of the systems described above for personal language 
exercise generation. 

In yet another embodiment, a method for generating a personalized 
10 language exercise includes: 

1. retrieving language exercises from a language exercise 

database; 

2. retrieving user data from a learner's profile database, 
stored in a digital storage medium, pertinent to a user; and 

15 3. selecting a language exercise from the language exercise 

database based on the user data. 

Brief Description of the Drawings 
The foregoing features of the invention will be more readily understood 
by reference to the following detailed description, taken with the 
20 accompanying drawings, in which: 

Fig. 1 is a block diagram illustrating a system for personalized language 
exercise generation in accordance with a preferred embodiment of the 
invention. 

Fig. 2A is a block diagram illustrating the morphologic analysis of text 
25 in accordance with the system of Figure 1. 

Fig. 2B is a block diagram illustrating the syntactic analysis of text in 
accordance with the system of Figure 1. 

Fig. 3 is a table identifying parameter characteristics associated with the 
learners' competence profile of the system in Figure 1 in accordance with a 
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language exercise includes: 

1. receiving a selected text at a. text input; 

2. linguistically analyzing the selected text to produce 
analytical data; 

3. retrieving user data fron:\ a learner's profile database, 
stored in a digital storage medium, pertinent to a user; and 

4. generating an exercise based on the selected text, the 
analytical data, and the user data. In linguistically analyzing the selected text 
it is advantageous to analyze both the morphology and the syntax of the text. 

In a further embodiment, the method includes retrieving user interest 
data from a learner's interest database, stored in a digital storage medium, 
pertinent to a user, and selecting a text, in such a manner that the selected text 
is related to the interests of the user. 

In a further related embodiment, the method includes scoring the 
results of use of the exercise and updating the learner's profile database based 
on the performance of the user in doing the exercise. The learner's profile 
database may include profile data for a non-null set of learners. The profile 
data may include parameters characterizing, for each learner in the set, such 
learner's linguistic competence. In implementing this embodiment, the profile 
data may usefully include parameters separately characterizing one or more 
items selected from the group including lexical competence, grammatical 
competence, semantic competence, and phonological competence. 

In another set of embodiments, a system for personalized language 
exercise generation has: 

1. a language exercise database; 

2. an exercise generator coupled to the language exercise 
database, the exercise generator for selecting a language exercise from the 
database; and 

3. a learner's profile database, coupled to the exercise 
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1. a text input for receiving a selected text; 

2. a linguistic analyzer coupled to the text input for 
analyzing the selected text; 

3. an exercise generator coupled to the linguistic analyzer for 
5 generating a language exercise from the selected text; and 

4. a learner's profile database, coupled to the exercise 
generator, in such a maimer that the language exercise reflects parameters of a 
user with respect to whom there exists an entry in the learner's profile 
database. The selected text may be any text that is desired by the user or an 

10 instructor as a source from which the exercise may be generated. The 

linguistic analyzer may advantageously include a morphology analyzer and a 
syntax analyzer. In a further embodiment, the system also includes a learner's 
interest database, coupled to the text input, in such a manner that the selected 
text is related to the interests of the user. The learner's interest profile can 

15 contain positive (interested in) and negative (not interested in, or forbidden 
sites) information. In a preferred embodiment, the learner's interest profile 
may be coupled to a Web agent. The Web agent searches the Internet for 
interesting texts. 

In a further embodiment, the system also includes a scoring 

20 arrangement for scoring the results of use of the exercise and for updating the 
learner's profile database based on the performance of a user in doing the 
exercise. The learner's profile database may include profile data for a non-null 
set of learners. The profile data may include parameters characterizing, for 
each learner in the set, such learner's linguistic competence. In implementing 

25 this embodiment, the profile data may usefully include parameters separately 
characterizing one or more items selected from the group including lexical 
competence, grammatical competence, semantic competence, and phonological 
competence. 

In another embodiment a method for generating a personalized 
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certain amount of time. A particular learner will generally only repeat the 
same exercise for a limited number of times. Therefore the usage of this type of 
program is limited, both with respect to content and to lifetime. 

Another type of language learning program is an "open language" 
learning system such as the "P.A.R.A.D.I.S.E" program, available from N. V. De 
Wilde CBT, Ottergemsesteenweg 703, B-9000 Gent, Belgium, holder of rights to 
the present invention. P. A.R.A.D.I.S.E. enables a language teacher to import 
any text (hence the definition "open system") and create various exercises on 
that freely chosen text. Exercises are based on a number of static criteria such 
as word category, word derivation, position in the text, frequency list, word 
list, etc. By making a selection from the available criteria, exercises can be 
oriented towards a specific learner or group of learners. However, these 
exercises will seldom perfectly match the learners' actual, specific needs and 
will certainly not take into account the learners' progress. 

A second limitation of the existing P.A.R.A.D.LS.E. program is the way 
in which the text is linguistically analyzed. The present programs analysis is 
limited to a morphologic reduction of the text words and is thus limited to 
analysis at the word level. Analyzing the word only on its own can cause 
ambiguity because it is possible for a certain word form, considered on its 
own, to have different interpretations. For instance, one and the same word 
form can be a noun or a verb. The correct interpretation cannot be deduced 
from the morphological analysis alone. 

Compared to the above described "authoring" and "closed" systems, the 
advantage of an "open authoring system", such as the P.A.R.A.D.LS.E. 
program is that the exercises can be generated by the teacher in a short lapse of 
time, on any text and with little effort. 

Summary of the Invention 
In a first embodiment, the invention provides a system for personalized 
language exercise generation. The system has 
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Apparatus and Method for Personalized Language Exercise Generation 

Technical Field 

The present invention relates to computerized learning assistance and 
particularly to computer-related devices and methods for assisting language 
5 learning. 

Background Art 

Different computer assisted language learning programs already exist. They 
may generally be classified either as "authoring systems** or as "closed systems". An 
"authoring system" is a program in which a language teacher has the full freedom to 

10 create exercises for a specific learner or a group of learners. Such a program allows a 
learner or group of learners to work on any given topic or text as well as on various 
language skills. Generally, authoring systems are designed to be used for various 
domains in education and training such as history, geography, etc., hence, they do 
not contain linguistic features or "intelligence". Thus, authoring systems require the 

15 teacher to put in the entire content and structure of the exercises manually. Teachers 
must type in the text, define the exercise procedure, create the exercises and 
anticipate the student's answers by providing possible feedback. In other words, 
while authoring systems allow plenty of freedom in the creation of exercises, they 
also require a lot of effort from the teacher. Authoring systems are extremely 

20 laborious and time-consuming. 

A "closed system," on the other hand, reduces the time and effort required of 
the teacher. In a closed system the entire content, i.e. the texts and the exercises, are 
already contained within the system. The full content of this type of program is thus 
readily available but fixed. Consequently, the teacher and student are limited to the 

25 texts and exercises available in the system. In addition, the presented exercises are 
identical for every learner. This severely limits the possibilities of use because such a 
program may only correspond to the very specific needs of a particular learner or a 
group of learners by coincidence. Even when the exercises do correspond to the 
specific needs of a particular learner, the exercises will tend to be boring after a 
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A system for generating personalized lan- 
guage exercises includes a text input, a linguistic 
analyzer, an exercise generator and a learner's 
profile database. The linguistic analyzer may 
perform morphological and syntactic analysis on 
the input text to identify linguistic data. The 
learners interest profile contains information re- 
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generator generates language exercises using the 
linguistic data identified by the linguistic ana- 
lyzer and the linguistic competence information 
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